“Why Corrigibility is Hard, and Important [IABED Resources]” by Raemon
Description
I worked a bunch on the website for If Anyone Builds Its Online Resources. It went through a lot of revisions in the weeks before launch.
There was a particular paragraphs I found important, which I now can't find a link to, and I'm not sure if they got deleted in an edit pass or if they just moved around somewhere I'm failing to search for.
It came after a discussion of corrigibility, and how MIRI made a pretty concerted attempt at solving it, which involved bringing in some quite smart people and talking to people who thought it was obviously "not that hard" to specify a corrigible mind in a toy environment.
The paragraph went (something like, paraphrased from memory):
The technical intuitions we gained from this process, is the real reason for our particularly strong confidence in this problem being hard."
This seemed like a pretty [...]
---
Outline:
(03:21 ) Intelligent (Usually) Implies Incorrigible
(10:42 ) Shutdown Buttons and Corrigibility
(23:42 ) A Closer Look at Before and After
---
First published:
September 30th, 2025
---
Narrated by TYPE III AUDIO.